USFD: a unified storage framework for SOAR HPC scientific workflows

نویسندگان

  • Grant Mackey
  • Saba Sehrish
  • Christopher Mitchell
  • John Bent
  • Jun Wang
چکیده

Emerging scientific workflows in HPC focus more on analysis rather than simulation. Simulation output is so dense with information that copious amounts of analysis must be performed on a single output to understand the results of that simulation. We identify this repetitive analysis as a new application type, Simulate Once Analyze Repeatedly (SOAR) Computing. Current scientific HPC, when extended to SOAR computing, results in excessive data migration between compute and storage resources. For a workflow bound by file I/O, a large data migration overhead is unacceptable. We propose a framework which uses a data-intensive storage cluster coupled with an interoperability layer, called USFD. USFD is a Unified Storage Framework designed to better support SOAR HPC scientific workloads through enhanced file I/O support and co-located storage and analysis. In this work we analyze the performance of USFD and other traditional HPC approaches for SOAR scientific workloads. Our results show that SOAR workflows which use USFD complete analysis at a 7.5x performance increase over other approaches with QCD and 4x performance increases with FLASH.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Characterization of Scientific Workflows for the Optimal Use of Burst Buffers

Scientific discoveries are increasingly dependent upon the analysis of large volumes of data from observations and simulations of complex phenomena. Scientists compose the complex analyses as workflows and execute them on largescale HPC systems. The workflow structures are in contrast with monolithic single simulations that have often been the primary use case on HPC systems. Simultaneously, ne...

متن کامل

Scientific workflow orchestration interoperating HTC and HPC resources

In this work we describe our developments towards the provision of a unified access method to different types of computing infrastructures at the interoperation level. For that, we have developed a middleware suite which bridges not interoperable middleware stacks used for building distributed computing infrastructues, UNICORE and gLite. Our solution allows to transparently access and operate o...

متن کامل

Dataflow-Based Scheduling for Scientific Workflows in HPC with Storage Constraints

In high-performance computing (HPC), workflow-based workloads are usually data intensive for exploratory analysis of a scientific computation problem that may involve a large parameter space. To achieve the best performance, storage resource constraint is always a pragmatic concern in reality as the potential problem space scale, especially in big data science, as well as its required dataset a...

متن کامل

Mero: Co-Designing an Object Store for Extreme Scale

Within the HPC community, there is consensus that Exascale computing will be plagued with issues related to data I/O performance and data storage infrastructure reliability, caused primarily by the growing gap between compute and storage performance, and the ever increasing volumes of data generated by scientific simulations, instruments and sensors. The architectural assumptions for extreme co...

متن کامل

A Unified Approach for Modeling and Optimization of Energy, Makespan and Reliability for Scientific Workflows on Large-Scale Computing Infrastructures

Green computing has received significant attention in the past few years. Although some research has addressed cooling and energy usage reduction in large data-centers [1], they do not control how resources are used by applications. Scientific workflows are a useful representation for managing the execution of large-scale computations on high performance computing (HPC) and high throughput comp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJPEDS

دوره 27  شماره 

صفحات  -

تاریخ انتشار 2012